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Amendments to the Claims : 
This Usting of claims replaces all prior versions and listings of claims in the apphcation: 

Listing of Claims : 

1 . (currently amended) A method executed by a computer under the control of a 
program, said computer including a memory for storing said program, said method comprising 
the steps of: 

r e c e iving at l e ast on e prot e in backbon e structur e providing an ensemble of related 
backbone structures : 

applying a protein design algorithm to generate a protein sequence or set of protein 
sequences and structur e: 

sampling and evaluating one or more amino acids m4 or rotamers within the context of 
said protein sequence or set of protein sequences and at least one backbone structure; and 

generating a probability matrix for said amino acids md or rotamers that represent the a 
viable sequence space for said prot e in ensemble of backbone structures . 

2. (original) A method according to claim 1 further comprising the step of: 
generating a single protein sequence from said probability matrix. 

3. (original) A method according to claim 1 further comprising the step of: 
generating a combinatorial library of proteins from said probability matrix. 

4. (original) A method according to claim 1 wherein said steps are repeated more 
than once to generate said probability matrix 
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5. (original) A method according to claim 1 wherein said protein design algorithm 
comprises an optimization procedure selected from the group of: dead end elimination algorithm; 
genetic algorithm; Monte Carlo algorithm; and self consistent mean field theory algorithm or 
combinations thereof 

6. (currently amended) A method according to claim 1 wherein said protein 
backbon e structur e is talc e n from at least one backbone structure of the ensemble is derived from 
the structure of a natural protein. 

7. (currently amended) A method according to claim 1 wherein said prot e in 
structur e at least one backbone structure of the ensemble is generated by comparative modeling. 

8. (currently amended) A computer-executable method according to claim 1 
wher e in th e comprising: 

receiving at least one protein backbone structure: 

generating at least two probability matrices that each represent the viable sequence space 
for the protein backbone subject to a constraint 

combining information from said probability matrices is combin e d to satisfy at least two 
constraints on sequence space. 

9. (canceled) 

10. (canceled) 

1 1 . (canceled) 

12. (currently amended) A method according to claim 9 8 wherein said steps are 
repeated more than once to generate said probability matrix. 
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13. (currently amended) A method according to claim 9- 8 wherein said generating 
step comprises a p rotein design algorithm that comprises an optimization procedure selected 
from the group of: dead end elimination algorithm; genetic algorithm; Monte Carlo algorithm; 
and self consistent mean field theory algorithm or combinations thereof. 

14. (currently amended) A method according to claim 9 I wherein said ensemble of 
related prot e in backbone structures ar e talc e n from comprises backbone structures of a family of 
natural proteins. 

15. (currently amended) A method according to claim 9 i wherein said ensemble of 
related backbone structures is derived from an NMR structure. 

16. (currently amended) A method according to claim 9 I wherein said ensemble of 
related protein backbone structures is generated by a Monte Carlo simulation. 

17. (currently amended) A method according to claim 91 wherein said ensemble of 
related protein backbone structures is generated by a molecular dynamics simulation. 

18. (currently amended) A method according to claim 9i wherein the information 
from at least two probability matrices is combined to satisfy at least two constraints on sequence 
space. 
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19. (currently amended) A method execut e d by a comput e r und e r the control of a 
program, said comput e r including a m e mory for storing said program, said m e thod comprising 
the computer-executable steps of: 

receiving at least one compl e te protein sequence and structure; 

sampling and evaluating on e or mor e amino acids and rotam e rs within th e cont e xt of said 
prot e in s e qu e nc e and structur e: and 

generating a probability matrix for said amino acids and rotamers that represent the 
viable sequence space for said protein backbone by evaluating fitness of one or more amino 
acids within the context of said protein sequence and structure . 

20. (currently amended) A method according to claim 19 wherein said protein 
sequence an and structure is correspond to that of a natural protein. 

2 1 . (currently amended) A method according to claim 1 9 wherein said prot e in 
s e qu e nc e and structur e compris e s an ensemble of related protein structures is generated from 
said received protein sequence and structure . 

22. (original) A method according to claim 21 wherein said ensemble of proteins is 
generated by a Monte Carlo simulation. 

23. (original) A method according to claim 21 wherein said ensemble of proteins is 
generated by a molecular dynamics simulation. 

24. (original) A method according to claim 19 wherein said steps are repeated more 
than once to generate said probability matrix. 

25. (currently amended) A method according to claim 19 further comprising the step 
of: g e n e rating producing a single protein sequence from said probability matrix. 
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26. (currently amended) A method according to claim 19 further comprising the step 
of: g e n e rating producing a library of proteins from said probability matrix. 

27. (original) A method according to claim 19 wherein said protein sequence and 
structure is generated by comparative modeling. 

28. (original) A method according to claim 19 wherein said protein sequence and 
structure is taken from a natural protein. 

29. (original) A method according to claim 19 wherein the information from at least 
two probabihty matrices is combined to satisfy at least two constraints on sequence space. 

30. (original) A method for optimizing simulation or scoring function parameters that 
utilizes comparisons between designed sequences and natural sequences, comprising the steps of: 
designing a protein sequence; comparing said designed protein sequence to natural protein 
statistics; modifying said simulation or scoring function parameters consistent with said 

' comparison. 

31 . (original) A method according to claim 30 wherein said steps are repeated at least 

once. 

32. (original) A method according to claim 30 wherein said natural protein statistics 
are in the form of a position specific scoring matrix. 

33. (original) A method according to claim 30 wherein said natural protein statistics 
are in the form of amino acid composition. 
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34. (original) A method for optimizing simulation or scoring function parameters that 
utilizes comparisons between designed sequences and natural sequences, comprising the steps of: 
calculating an amino acid probability matrix; comparing said matrix to natural protein statistics; 
modifying simulation or scoring function parameters consistent with said comparison. 

35. (original) A method according to claim 34 wherein the sequence of steps is 
repeated at least once. 

36. (original) A method according to claim 34 wherein said natural sequence 
statistics are in the form of a position specific scoring matrix. 

37. (original) A method according to claim 34 wherein said natural sequence 
statistics are in the form of amino acid composition. 

38. (new) A method according claim 25 wherein the amino acid sequence of the 
single protein sequence is selected by identifying the amino acid with the lowest free energy at 
each position. 

39. (new) A method according claim 26 wherein the library is designed by a 
procedure comprising selecting an upper limit on free energy, allowing amino acid variations 
among amino acids that are below the upper free energy limit. 

40. (new) A method according claim 26 wherein the library is designed by a 
procedure comprising incorporating amino acids at incrementally lower probabiUties until a 
desired complexity is achieved. 
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41 . (new) A method according to claim 8 wherein the at least two constraints 
comprise a first constraint corresponding to a first structural form and second constraint 
corresponding to a second structural form that is distinct fi"om the first structural form. 

42. (new) A method according to claim 8 wherein a first and second probability 
matrix are combined by adding or subtracting firee energies values fi-om said probability 
matrices. 

43. (new) A method according to claim 8 wherein the combining process is iterated. 

44. (new) The method of claim 1 wherein the sampling comprises freezing side chain 
identities and rotamers at all other positions in the protein. 

45. (new) The method of claim 1 wherein the probability matrix is expressed as a set 
of partition fimctions. 

46. (new) The method of claim 1 wherein the probability matrix is expressed as a firee 
• energy value. 

47. (new) The method of claim 1 wherein the probability matrix comprises 
information for all twenty amino acids. 

48. (new) The method of claim 3 fiirther comprising screening or selecting one or 
more proteins fi*om the library. 

49. (new) The method of claim 4 wherein, in a subsequent cycle, the protein design 
algorithm uses the probability matrix fi^om a previous cycle. 
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50. (new) The method of claim 48 wherein the screening or selecting comprises 
identifying a protein with enhanced activity, improved stability, or altered specificity. 

5 1 . (new) A method comprising: 

generating a first probability matrix for amino acids at positions in a protein sequence by 
evaluating fitness of rotamers of a plurality of amino acids within the context of the protein 
sequence and a first constraint on sequence space; 

generating a second probability matrix for amino acids at positions in the protein 
sequence by evaluating fitness of rotamers of a plurality of amino acids within the context of the 
protein sequence and a second constraint on sequence space; and 

combining information fi"om at least the first and second probability matrices to satisfy at 
least the first and second constraints on sequence space. 

52. (new) The method of claim 51 wherein the first and second constraints correspond 
to two distinct structural forms. 

53. (new) The method of claim 51 wherein the information is combined by adding or 
' subtracting fi-ee energies. 

54. (new) The method of claim 51 wherein the information is combined iteratively. 

55. (new) The method of claim 54 wherein the information is combined iteratively 
until convergence is attained. 

56. (new) The method of claim 51 fiorther comprising, prior to generating the first and 
second probability matrix, applying a protein design algorithm to generate a protein sequence for 
a protein backbone structure. 
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57. (new) A method comprising: 

for each of an ensemble of related protein structures, identifying a set of side chain 
identities and rotamer orientations suitable for the respective protein structure; 

updating a matrix representing fitness of amino acid rotamers at positions in the protein 
by sampUng rotamer fitness in the context of each identified set of side chain identities and 
rotamer orientations; and 

repeating the step of identifying at least once, wherein a subsequent step of identifying 
uses the updated matrix to identify set of side chain identities and rotamer orientations suitable 
for the respective protein structure. 

58. (new) An article of computer- accessible memory that has instructions for directing a 
computer to execute a method comprising: 

for each structure of an ensemble of related protein structures, identifying a set of side 
chain identities and rotamer orientations suitable for the respective protein structure; 

updating a matrix representing fitness of amino acid rotamers at positions in the protein 
structures by sampUng rotamer fitness in the context of each identified set of side chain identities 
and rotamer orientations; and 

repeating the step of identifying at least once, wherein a subsequent step of identifying 
uses the updated matrix to identify set of side chain identities and rotamer orientations suitable 
for the respective protein structure. 

59. (new) A computing apparatus comprising: a central processor, a memory, and an 
input/output bus, 

the memory being configured to store a matrix representing fitness of amino acid 
rotamers at a plurality of positions in a model of a protein, 

the processor being configured to execute a method comprising: 
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for each structure of an ensemble of related protein structures, identifying a set of 
side chain identities and rotamer orientations suitable for the respective protein structure; 

updating the matrix stored in the memory by samphng rotamer fitness in the 
context of each identified set of side chain identities and rotamer orientations; and 

repeating the step of identifying at least once, wherein a subsequent step of 
identifying uses the updated matrix in the memory to identify set of side chain identities 
and rotamer orientations suitable for the respective protein structure. 

60. (new) A method of providing a protein, the method comprising: 
performing the method of claim 1 ; and 

producing at least one protein comprising a sequence based on said probability 



61 . (new) A method of providing a protein library, the method comprising: 

performing the method of claim 1 ; and 

producing a library of proteins that include proteins that each comprise a sequence 
based on said probability matrix. 

62. (new) A library of proteins comprising a plurality of proteins designed by the method 
of claim 3. 

63. (new) A protein comprising an amino acid sequence designed by the method of 
claim 2. 

64. (new) A method executed by a computer under the control of a program, said 
computer including a memory for storing the program, the method comprising: 

providing an ensemble of related backbone structures; and 

applying a design procedure to generate a protein sequence or set of protein sequences. 



matrix. 
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65. (new) A method executed by a computer under the control of a program, said 
computer including a memory for storing the program, the method comprising: 

providing an ensemble of related backbone structures; and 

generating a matrix of amino acid probabilities that represents a viable sequence space 
for said ensemble of backbone structures. 

66. (new) A method comprising the computer-executable steps of: 
receiving at least one complete protein sequence and structure; 

sampling and evaluating one or more amino acids and rotamers within the context of said 
complete protein sequence and structure; and 

generating a probability matrix for said amino acids and rotamers that represent the 
viable sequence space for said protein structure. 



